GRAPHICAL REWRITD^G SYSTEM FOR MULTIMEDIA DESCRIPTIONS 

RELATED APPLICATIONS 
[0001] This application claims the benefit of U.S. provisional applications 
serial numbers 60/242,2^ filed on October 20, 2000, 60/261,100 filed on January 10, 
2001, 60/280,608 and 60/28«L703 both filed on March 29, 2001, and 60/305,008 filed on 
July 11, 2001, and which are herein incorporated by reference. 

FIELD OF THE INVENTION 
[0002] This invention relates generally to descriptions for multimedia content, 
and more particularly to graphical representation of such descriptions. 

COPYRIGHT NOTICE/PERMISSION 
[0003] A portion of the disclosure of this patent document contains material 
which is subject to copyright protection. The copyright owner has no objection to the 
facsimile reproduction by anyone of the patent document or the patent disclosure as it 
appears in the Patent and Trademark Office patent file or records, but otherwise reserves 
all copyright rights whatsoever. The following notice applies to the software and data as 
described below and in the drawings hereto: Copyright © 2000, Sony Electronics, Inc., 
All Rights Reserved. 

BACKGROUND OF THE INVENTION 
[0004] Digital multimedia content is becoming widely distributed through 
broadcast transmission, such as digital television signals, and interactive transmission. 
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such as the Internet. The content may be still images, audio feeds, or video data streams. 
However, the enthusiasm for developing multimedia content has led to increasing 
difficulties in managing, accessing and identifying such a large volume of content. 
Furthermore, complexity and a lack of adequate indexing standards are problematic. 

[0005] The Moving Picture Experts Group (MPEG) has promulgated a 
Multimedia Content Description Interface, conmionly referred to as MPEG-7, to 
standardize the description of multimedia content when it is transmitted from a system 
that generates the content to a system that presents the content to a user. In contrast to 
preceding MPEG standards such as MPEG-1 and MPEG-2, which relate to coded 
representation of audio-visual content, MPEG-7 is directed toward representing 
information relating to content, and not the content itself. The MPEG-7 standard seeks to 
provide a rich set of standardized tools for describing multimedia content, with the 
objective of providing a single standard for creating interoperable, simple and flexible 
solutions for indexing, searching and retrieving multimedia content. 

[0006] More specifically, MPEG-7 defines and standardizes a core set of 
"descriptors" for describing the various features of multimedia content; "description 
schemes" for describing relationships among the descriptors, the descriptors and other 
description schemes, and among description schemes; and a "description definition 
language" (DDL) for defining the description schemes and descriptors. The descriptions 
and description schemes for a particular type of multimedia content are encoded into a 
DDL-based schema. Each descriptor entry in the schema specifies the syntax and 
semantics of the corresponding feature. Each description scheme entry in the schema 
specifies the structure and semantics of the relationships among its children components. 

[0007] For example, a standard movie includes scenes, shots within scenes, 
titles for scenes, and time, color, shape, motion, and audio for shots. The corresponding 
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schema would contain descriptors that describe the features of the content, such as color, 
shape, motion, audio, title, etc., and one or more description schemes, e.g., a shot 
description scheme that relates the features of a shot, and a scene description scheme that 
relates the different shots in a scene and relates the title of the scene to the shots. 

[0008] A description scheme can be represented by a graph in which each 
descriptor and nested description scheme corresponds to a node, with the relationships 
among the descriptors and description schemes corresponding to the edges of the graph. 
Working with the graphical representations instead of the description schemes themselves 



O 
□ 

^ introduces an additional level of abstraction that can make searching for, and modiflcation 



of, description schemes more efficient. However, the graphs are generally complex and 
^ creating valid and efficient graphs will be difficult for most users. 

s 

O 

m 

n I SUMMARY OF THE INVENTION 

N [0009] Graphs representing multimedia content are validated and modified 

through graph matching and graph rewriting techniques using pre-defined model graphs. 
In one aspect, an input graph is validated against a model template graph. In another 
aspect, suggestions for modifying an input graph are provided based on model alphabet, 
rule and morphism graphs. The resulting graphs may represent queries against existing 
description schemes or new description schemes. 

[0010] The present invention describes systems, clients, servers, methods, and 
computer-readable media of varying scope. In addition to the aspects and advantages of 
the present invention described in this sunmiary, further aspects and advantages of the 
invention will become apparent by reference to the drawings and by reading the detailed 
description that follows. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
[0011] Figure 1 is a diagram illustrating a overview of the operation of an 
embodiment of a graphical rewriting system according to the invention; 

Figure 2 is a flow diagram of a validation method performed by the graphical 
rewriting system of Figure 1; 

Figure 3A illustrates one embodiment of a graph matching technique employed by 
the validation method of Figure 2; 

Figure 3B illustrates a mapping between graphs according to one embodiment of 
the invention; 

Figure 4 is a flow diagram of a modification method performed by the graphical 
rewriting system of Figure 1 ; 

Figures 5 A-D illustrate pushout rewriting techniques employed by the 
modification method of Figure 4; 

Figures 6A-E illustrate puUback rewriting techniques employed by the 
modification method of Figure 4; 

Figure 7 is an example of node replacement by the graphical rewriting system of 
Figure 1 ; and 

Figure 8 is a diagram of a computer environment suitable for practicing the 
invention. 

DETAILED DESCRIPTION OF THE INVENTION 
[0012] In the following detailed description of embodiments of the invention, 
reference is made to the accompanying drawings in which like references indicate similar 
elements, and in which is shown by way of illustration specific embodiments in which the 
invention may be practiced. These embodiments are described in sufficient detail to 
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enable those skilled in the art to practice the invention, and it is to be understood that 
other embodiments may be utilized and that logical, mechanical, electrical and other 
changes may be made without departing from the scope of the present invention. The 
following detailed description is, therefore, not to be taken in a limiting sense, and the 
scope of the present invention is defined only by the appended claims. 

[0013] Beginning with an overview of the operation of the invention. Figure 1 
illustrates one embodiment of a graph rewriting system 101 that validates input graphs 
107 and also suggests changes to an input graph 107 to aid a user lOS in creating a graph 
109 to search or modify a description scheme for multimedia content. A list of model 
graphs, referred to as graphical classification schemes (GCS) 103, constrain the nodes and 
the arrangement of edges for the description schemes. The model graphs in the GCS 103 
are used to validate input graphs 107 that represent user searches (queries) for, or 
modifications to, an existing description scheme. Because of the difficulty in creating a 
graph that represents a complex query or a description scheme, the user 105 may not be 
able to construct an valid and efficient input graph without assistance. The model graphs 
in the GCS 103 also can be used to generate suggested changes to a sample input graph 
that will result in the desired graph 109. 

[0014] Validation of an input graph 107 is accomplished using graph matching 
techniques. Correspondences are created between the input graph 107 and model 
template graphs in the GCS 103. If all parts of the input graph 107 can be built from the 
template graphs, the graph is valid and the query can be run or the modified description 
scheme can added to the GCS 103 for subsequent use. 

[0015] When the user 105 wants to modify a sample input graph, such as 
graph 107, the sample input graph is matched against model alphabet graphs in the GCS 
103. If a match is found, rules associated with the matching alphabet graph may be 
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applied to the input graph 107 to assist the formulation of a more complex graph 109. 
The rules are expressed in terms of model rule and morphism graphs stored in the GCS 
103, and the input graph is revised accordingly through graph rewriting techniques. The 
revised input graph is matched against the alphabet graphs and the process repeated until 
the user is satisfied with the resulting graph. 

[0016] The graph rewriting system 101 may be implemented on a stand-alone 
user computer or may be implemented on a server or servers coupled to the user through a 
network. One embodiment of an applicable computer environment is illustrated in Figure 
8 and described further below. 

[0017] The graph rewriting system 101 uses graph matching in validating an 
input graph and in rewriting a graph. The input graph 107 consists of a node list and an 
edge list. The node list may include node types and the edge list may include edge labels. 
The node list may be optional as it can be derived from the edge list in some cases. In one 
embodiment, an input graph is compared against model GCS graphs node by node and 
edge by edge using node types and edge labels. A starting, single edge in the input graph 
is matched by checking the type or instantiations of the nodes at each end of each edge, 
along with the label of the edge itself, against the model graphs. Generally, a graph is 
matched to template graphs when validating an input graph and to alphabet graphs when 
rewriting an input graph. A template graph acts as a template for the formation of 
descriptions that fit a particular model, and thus can be used to verify that a particular 
graph is of a particular template type. An alphabet graph specifies how an input graph is 
to be interpreted during the rewriting process. 

[0018] A validation method 200 that can executed by a computer to compare 
input and template graphs is illustrated in Figure 2. A starting edge is selected at block 
201. The type of the source node for the edge is obtained at block 203 and the type of the 

080398.503 -7- 



target node at block 205. A lexicographical search is performed at block 207 to find the 
group of valid edges in the template graphs that match the source and target nodes, i.e., 
the subgraphs formed by the source and target nodes. If the current edge appears in the 
group (block 209), the edge is valid and is marked (block 211). Another unmarked edge 
is selected that has the current target node as its source node (block 217) and the 
validation method 200 loops back to block 205. The method 200 continues until every 
edge is marked (block 213), meaning the input graph is valid, or until the node list is 
exhausted (block 215) or an edge is not valid (block 209), meaning the input graph is 
invalid. 

[0019] An example of node and edge matching is shown in Figure 3 A in 
which correspondences between the nodes and edges of an input graph 303 and a template 
graph 305 are illustrated by dashed lines. Node a 307 in input graph 303 corresponds to 
node X 319 in template graph 305. Nodes b 309 and c 31 1 correspond to node y 321. 
Edges d 313 and f 317 correspond to edge z 325 and edge e 315 corresponds to edge w 
323. Note that the endpoints of the edges d 313, e 315, and f 317 in the input graph 303 
also correspond to the appropriate endpoints of edges w 323 and z 325 in the template 
graph 305. Thus, the input graph 303 is valid because it can be built from the template 
graph 305. 

[0020] Because exact matching may prove inefficient for large graphs, 
approximation techniques may be used to match the input and template graphs. To 
simplify the process further, the labels on the edges are initially discarded. In one 
embodiment, the input and template graphs are matched using matrix functions on 
"adjacency matrices" that represent the graphs. For any graph, the corresponding 
adjacency matrix a contains rows and colunms labeled for the nodes of the graph. If two 
nodes, / and 7, have no edge connecting them, the value of is zero, otherwise the value 
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is a positive number. Commonly, a value of one is used to designate that the node pair 
forms the endpoints of an edge. Other values may be used to denote the length of the 
edge. Undirected graphs have synmietrical matrices. 

[0021] As an example, the adjacency matrix for the input graph 303 and the 
template graph 305 of Figure 3 A are shown in Tables 1-A and 1-B, respectively. Note 
that edge w 323 is a directed edge having node y 321 as both its endpoints, which causes 
the adjacency matrix for the template graph 305 to be asynmietric. 
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1 


c 
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1 


0 
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0 
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Y 


1 


1 



Table l-A 



Table 1-B 



[0022] The matching between the adjacency matrices may be accomplished 
through several techniques. In one embodiment, a permutation matrix is created that, 
when multiplied by the graph adjacency matrix for the input graph, gives the best match 
with a graph adjacency matrix for a template graph. The match is then the best match 
between the input graph and the template graph, using the permutation that brings them 
closest together. In another embodiment, a "least squares" algorithm is used to fmd the 
best match. Once the best match has been determined, the edge labels are checked as in 
the simplistic approach to finish the validation. 

[0023] What constitutes a best match may be determined by measuring the 
distances among the matching points in the adjacency matrices. Different distance 
measures are used for different types of matches. For example, if the entire input graph is 
matched against the template graphs, the best match is based on global distance measured 
among the elements of the adjacency matrices. 
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[0024] Because large, complex graphs are composed of a series of more 
simplistic subgraphs, adjacency matrices for the subgraphs of the input graph may be 
matched against adjacency matrices for the subgraphs of the template graphs using closest 
point matches. In still another embodiment, singular valid decomposition (SVD) 
transformations are applied to the adjacency matrices to create less sparse matrix 
approximations that can be more efficiently tested for matches. 

[0025] Graph rewriting is the process of combining graphs, replacing nodes 
with graphs, replacing edges with more complex configurations, and generating series of 
"production steps" to move from one graph to another. A modification method 400 that 
can be executed by a computer to rewrite a graph is illustrated in Figure 4. Graph 
matching as described above is used to determine matches between the working graph 
(initially the input graph) and the alphabet graphs. If there are any matches (block 401), 
the method 400 performs a rewrite graph loop for each matching alphabet graph starting 
at block 403 and ending at block 413. The rule graph associated with the alphabet graph 
is presented to the user (block 405) and, if approved (block 407), is applied to the working 
graph (block 409) using an algebraic graph granmiar as described further below to create a 
new working graph. In an alternate embodiment, the graph gran^nar processing 
represented by block 409 reduces each graph to text strings, and rewrites the text strings 
to the new working graph. Because description schemes are typically directed acyclic 
graphs (forests of trees) and attempting to map a text string back to an acyclic graph often 
results in infinite trees, this alternate embodiment requires additional processing at block 
409 to avoid such infinite loops. 

[0026] If the approved rule graph requires new data, the method 400 obtains 
the data from the user (block 41 1). If a mle graph is not approved (block 407), the 
method 400 continues the rewrite graph loop with the next rule graph. 
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[0027] The working graph created by the rewrite graph loop can be further 
modified, if desired (block 415), by returning to block 401 to find further applicable 
alphabet graphs. If no further changes are desired at block 415, or if no alphabet graphs 
are found in the graph matching process at block 401, the modification method 400 exits. 
The graph resulting from the modification method 400 can be applied or stored, as 
desired. 

[0028] Algebraic graph granmiars are based on category theory, in which the 
nodes and edges of a graph, or whole graphs, are treated as objects, and functions, 
relations, or mappings, called morphisms, are defined that map from the objects in a 
domain graph to the like objects in an image graph. Returning now to Figure 3A, the 
correspondences between the nodes and edges in the input (domain) graph 303 and the 
nodes and edges in the template (image) graph 305 create a morphism F 301 as illustrated 
in Figure 3B. Morphism F 301 can be represented as a morphism graph in which the 
morphism nodes are the nodes and edges of the domain 303 and image 305, and 
morphism edges are the correspondences from the domain graph 303 to the image graph 
305. 

[0029] Two particular operations in algebraic graph granmiars are suitable to 
build graph rewriting techniques for description scheme graphs: pushout and pullback. 
Pushouts and puUbacks can be thought of as sums and products, respectively. A pushout 
takes the elements of a conmion piece between two graphs and uses those elements to 
glue the graphs together. A variation, called a double pushout operation (DPO), performs 
a cut and paste operation on the graphs. A pullback takes the elements of two graphs that 
map to elements in a third graph and computes all the combinations of these elements. 
Given an input graph and the appropriate alphabet, rule and morphism graphs, there is a 
pullback operation equivalent to each double pushout, but there are some operations, 
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involving the creation of graph elements for which there is only a puUback operation, no 
double pushout. 

[0030] An exemplary pushout operation 500 is shown in Figure 5A. 
Morphisms f 509 and g 51 1 are used to map from a graph A 501 to a graph B 503 and a 
graph C 505, respectively. To complete the pushout rule 500, morphisms h 513 and j 515 
are found to join the graph B 503 and the graph C 505 together along their conmion 
portions defined by the graph A 501. In an example shown in Figure 5B, a pushout 
operation is applied to input graph 525 through an alphabet graph 521. Rule graph 523 is 
associated with the alphabet graph 521 and is thus '"glued" onto the input graph 525 at the 
common portion defined by the alphabet graph 521 to create output graph 527. Referring 
back to Figure 5 A, in one embodiment, the rule graph B 503, the alphabet graph A 501, 
and the morphism graph for morphism h 513 are stored in the graphical classification 
scheme, while the morphism j 515 is part of the instantiation or application of the 
production rule. 

[0031] Figure 5C illustrates a double pushout operation 530. Given a starting 
graph L 533, K 531 is the context part in which the nodes and edges will be preserved in 
the result graph R 535. A morphism 1 537 and morphism r 539 are used to map from K to 
L and k to R, respectively, and form a production rale. To apply the double pushout to an 
input graph G 541, a graph morphism m 547 is found that maps from graphs L 533 to G 
541, with the restriction that no element in G 541 is the target of two elements in L 533, 
one of which is in the image of K 531 and the other is not. Assuming such a morphism m 
547 can be found, then a morphism d 549 can be inferred that maps from K 531 to a graph 
D 543, completing the first pushout square as previously illustrated in Figure 5B. Once D 
543 is created, a morphism r* 551 is found that maps from D 543 to the output graph H 
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545, completing the second pushout square, which applies the rule 530 to the input graph 
541 to produce the desired output graph 545. 

[0032] The application of the double pushout operation 530 can be viewed as 
a cut and paste process as illustrated in Figure 5D using the input and resulting graphs 
from Figure 1. The production rule specifies that starting graph 563 has everything cut 
away except the common part defined by graph 561. The part of graph 565 that interfaces 
with graph 561 is pasted onto the remaining portion of graph 563 using morphism R 569. 
When the production rule is applied to input graph 571, morphism M 577 specifies which 
parts of the input graph 571 will be discarded and which parts will be retained in the 
resulting graph 575. The smallest mapping M* 585 of graph 565 to the retained portion 
of graph 571 is pasted onto the retained portion of graph 571 to produce the output graph 
575. Referring again to Figure 5C, in one embodiment, the graphs L 533, K 531, and R 
535 are stored in the graphical classification scheme, together with the morphism graphs 
for morphisms 1 537 and r 539. Graphs L 533 and R 535 are stored as rule graphs and K 
531 as an alphabet graph. The morphism m is instantiated as necessary to apply the 
production rule to the input graph G 541. 

[0033] A variation of the DPO creates a pushout cube from the original rule, 
which creates a new implied rule. It is from this new rule that the result of the production 
step is inferred. The DPO essentially deletes old material, and then appends new material. 
The DPO variation appends the new material first, then deletes the old material. If a node 
is deleted in a tree, everything below it disappears. Because the pushout cube variation 
appends first, it avoids resending of old data. 

[0034] PuUback grammars have considerable advantages over pushout 
granmiars when used with infinite graphs. This is important for applications, such as 
description schemes, because pullback grammars are applicable to changes on a graph 
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that do not specify the entire start and flnishing graphs as is required by DPO. Often an 
infinite graph is a good model for a embedded graph when the context into which it is 
embedded is not known. In a pullback as illustrated in Figure 6 A, a final graph 601 is 
produced from an alphabet graph C 607, and two mappings (morphism f 609 and 
morphism g 61 1) from the starting graph A 603 and a rule graph B 605 to the alphabet 
graph C 607. The final graph 601 graph has as its nodes the product nodes of the graphs 
A 603 and B 605 (often equated with the nodes of one or the other), and as edges the 

{=A product edges of the two graphs, as a result of the fibered product of the two graphs. 

O 

^ [0035] A fibered product is a subset of a tensor product. Assuming the nodes 

J and edges in graph A 603 are represented as a set of elements { ai ,a2, . . . ,an } and the nodes 

01 

^ and edges of graph B 605 as a set of elements {bi,b2,. - .bk}, the tensor product A®B is 

s 

H defined by attaching a copy of each element in B to each element in A, thereby producing 

G 

1=1 {aibi,aib2,...,aibk} 
{a2bi,a2b2,...,a2bk} 

{anbi,anb2,...,anbk}. 

The tensor product may be thought of as a set of fibers (vectors, in this case) over the set 

A 603, in which each fiber is in one-to-one correspondence to B 605, and corresponds to 

one value in A 603, i.e. 

ai{bi,b2,...,bk} 
a2{bi,b2,...,bk} 

aii{bi,b2,...,bk} 

where there is an operation called a projection such that p(aibj) = aj. The fibered product 
601 consists of only those pairs of product elements in the tensor product such that f(ai) = 
g(bj). The net effect is that the fibered product 601 can be viewed as fibers from B 605 
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over elements of A 603, where the base in A 603 is the set of elements from C 607 pulled 
back to A 603 by morphism f 609. Such a "pulling back" operation is accomplished by 
generating, for each element in C 607, the set of elements in A 603 which have this 
particular element as their image. This essentially "inverts" the morphism f, and is called 
the pullback ofC along f. Likewise from the point of view of B 605, the fibers are made 
up of those elements of A 603 that are pulled back from C 607 along morphism f 609, 
over a base that is pulled back to B 60S from C 607 along morphism g 61 1. Thus, the 
fibered product 601 consists of all pairs {aibj} with f(ai)= Ck = g(bj) for some Ck in C 607. 

[0036] Figure 6B illustrates a pullback operation using the input and resulting 
graphs from Figure 1. The alphabet graph 627 specifies how the input graph 623 will be 
interpreted. The morphisms F 629 and G 631 map objects from the input graph 623 and 
rule graph 625, respectively, to three parts of the alphabet graph 627. All parts of the input 
graph 623 which are not affected by the rewriting operation are mapped to a context part 
633. The new material on the rule graph 625, and the node to be modified from the input 
graph 623, are mapped onto an unknown node 637. The unknown node 637 is a variable, 
represented by a loop, so that it will match any valid rule for its node type and edge label. 
A neighborhood, or interface part 635, represents all of the possible nodes connected to 
the unknown node 637. Typically, the rule graph 625 consists of the context 633 and 
interface 635 of the alphabet graph 627, otherwise modifications may occur in places 
other than the unknown node 637. 

[0037] In practice, the resulting graph 621 contains an object (node or edge) 
from the input graph 623 if that object maps to an object in the alphabet graph 627 for 
which there is also an object mapped from the rule graph 625. When the unknown part 
637 of the alphabet graph 627 is a node, as illustrated in Figure 6B, the mapping from the 
input node and rule node is called a filter. If the inverse image of the unknown node on 
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the alphabet graph under the chosen filter is a single node on the input graph, the pullback 
operation rewrites that node with the corresponding portion of the rule graph, usually 
replacing it \yith a more complex graph structure. In general, if the maps to the alphabet 
graph are not filters, then multiple nodes may be rewritten, and a whole section of the 
input graph changes. 

[0038] Thus, the edges that exist in the resulting graph are precisely those in 
the rule and input graphs that have something in common: they are mapped to the same 
basic type of edge in the alphabet graph. Indeed, if there is no mapping of one or the 
other to some edge in common, there will be no such relation between nodes in the 
resulting graph. This enforces the rule that if a node is changed in such a way that some 
of its old relations are no longer valid, the new graph will not have these relations present. 
Likewise, if there is only a partial match of the nodes of the input with those of the rule, 
then those non-matching nodes of the rule are not transposed onto the output. This 
introduces a new form of restriction, loosely formulated as follows; The only relations 
that are allowed between a pair of entities that are nodes of a graph are those which may 
be mapped to edges between the nodes these entities map to on the appropriate alphabet 
graph. 

[0039] A double pullback (DPB) performs a cut and paste operation as 
illustrated in Figures 6D and 6E. In Figure 6D, the top pullback square collapses part of 
an input graph 681 into a node in graph A 683, leaving the remainder of the input graph 
681 unchanged. The bottom pullback glues those portions of graph D 693 that map to the 
unknown of graph C 687 (through morphism h 695) onto the collapsed node in graph A 
683 to create an output graph 697 in the same fashion as described above for the single 
pullback. Thus in Figure 6E, input graph 621 is partially collapsed into graph A 623 and 
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the nodes and edges of graph 696 that map to unknown 637 in graph 627 are mapped to 
the collapsed node in graph A 623, resulting in output graph 698. 

[0040] Adjacency matrices may be used in rewriting graphs as well as in graph 
matching. In an example illustrated in Figure 6C, the pullback operation of Figure 6A is 
performed on the graphs and morphism from Figure 3B. Input graph 643 and alphabet 
graph 647 correspond to graphs A and C in the pullback operation of Figure 6A, and 
morphism F 301 corresponds to morphism f. Rule graph 649 (corresponding to graph B) 
contains nodes 1 651, 3 653 and 2 655 that are connected through edges 5 659 and 6 657. 
The rule graph 649 is mapped to the alphabet graph 647 using morphism G 661. The 
tensor product of the graphs A 643 and B 649 is a set of nodes of the form ab, together 
with a set of edges ele2, where el is an edge from A 643, and e2 is an edge from B 649. 
The adjacency matrix of A 643 is represented in Table 1-B above. A similar matrix can be 
written for B as shown in Table 2. The tensor product of the two matrices is matrix in 
which each entry a in the adjacency matrix for A 643 is replaced by a subblock ab of 
entries starting with a and ending with each element from B 649. Since A has a 3x3 
adjacency matrix, and B has a 3x3 adjacency matrix, the new matrix will be This is 
known as a Kronecker or tensor product of two matrices, and is widely used. 
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1 
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0 



Table 2 



[0041] The fibered product represented by graph 641, which is the output of 
the product of the input graph 643 and the rule graph 649 over the alphabet graph 647, has 
as its nodes only the nodes in the tensor product whose a and b entries are mapped to the 
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same place on the alphabet graph. The fibered product has as its edges, likewise, only 
those edges mapped to the same edge in the alphabet. 

[0042] However, the tensor product and matrix operations are very intensive 
and generate too much graph. Like the alphabet and rule graphs, the input graph can be 
divided into context, interface, and unknown parts. Since the context part is unaffected, 
and, algorithmically, is unimportant in rewriting, in an alternate embodiment only the 
interface nodes and edges, and the unknowns are used to constmct adjacency matrices. 
The parts of the input and rule graphs that map to the unknown and interface of the 
alphabet graph are referred to as "pre-images." Assuming f and g are partial morphisms 
for morphism F 301 and G 661 that include only the interface and unknown parts, and v 
and e represent nodes and edges respectively, each of these functions, fv,fe (and their 
counterparts) defines a set of arrows between the objects (nodes or edges) in the input 
(rule) graph and the corresponding nodes (edges) on the alphabet graph. These objects 
and arrows define a bipartite graph that can be written as adjacency matrix, with the 
sources of fv(fe) as rows and the targets as columns. The matrix for Afv (Afe) is shown in 
Table 3-A and its counterpart Bgv(Bge) in Table 3-B. The matrix Afv * BgvT has a 1 
wherever the corresponding pair form a product node. Likewise the matrix Afe * BgeT 
contains a 1 for each edge. The product matrix can be read off the two resultants, 
producing the resulting graph 641 that contains nodes al 679, b3 663, b2 665, c3 667 and 
c2 669, and edges f6 671, f5, 673, d6 675 and d5 677. 
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Table 3-A Table 3-8 
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[0043] An example of node replacement is illustrated in Figure 7. The input 
graph 700 is a starting graph for a user that wants to find multimedia content that 
describes how to determine when a piece of fruit is ripe. The input graph 700 contains 
three nodes, fruit 701, ripe 703, and parameterization 705. During the graph rewriting 
process 707, the input graph 700 is matched with an alphabet graph in the GCS that 
parameterizes the same stmcture. Applying the associated rule graph to the input graph 
700 results in graph 710 in which the parameterization node 705 has been replaced by a 

^ more complex structure that relates perceived color to the ripeness of a piece of fruit. The 

p 

^ resulting graph 710 can now be used to search for content that describes the ripe colors of 
various types of fruit. The new structure is interpreted as a parameterization of the 

m 

K relationship between the fmit and the property "ripe". In MPEG-7 practice, the parameter 

^ values vary from zero to one, but they may take any range. This value is interpreted as the 

ffl value of the membership of "fruit" and "ripe" in the relation "property of. A relation is 

Q 

id formally a set, and this membership function is a measure of fiizzy set membership. It is a 

function of color 715, in this example, so it is embodied by a Semantic State called 
"Member function" 713. As the color of the fruit, described in the color histogram 711, 
varies from, for example, green to brown, the membership value varies from zero to one, 
indicating that the fruit is most ripe when it is brown, and least ripe when it is green. 
Another example could have applied another eligible rule, say one which parameterized 
the ripeness of fruit by a membership function and a "softness" value. 

[0044] MPEG-7 defines descriptors and description schemes in terms of XML 
(extensible markup language) elements. The following exemplary alphabet graph has an 
unknown node that allows objects or events, SemanticPlace or SemanticTime, in 
configurations where the object or event is located at the place and/or the object/event 
exists for a given duration. The alphabet graph is constructed as a single node, meaning 
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that there is an implicit context node, and that relations between the unknown and the 

context are unrestricted. The implicit context node takes any type, has any links, and is 

linked to the unknown node by all edge relations. 

<Graph name="Place and Duration" > 
<Graph id="unknownr' /> 

<Node id="object'7> 

<Node id="event'7> 

<Node id=:"SemanticPlace"/> 

<Node id="SemanticTime'V> 

<Relation ObjectSemanticPlace /> 

<Relation ObjectSemanticTime /> 

<Relation EventSemanticPlace /> 

<Relation EventSemanticTime /> 
</Graph> 
</Graph> 

[0045] The associated exemplary mle graph adds SemanticPlace to an Object 

and allows any object to have a SemanticPlace attached to it, with a relation of 

"LocationOf Note that, as it is written, there is no restriction to the number of such 

SemanticPlaces that are added to the original object, so the mle may be applied an 

unlimited number of times. 

<Graph> 

<Node id="SemanticPlace" /> 
<Node id="Object" /> 

<Relation source="SemanticPlace" target="Object" type= 'locationOf ' /> 
<ControlledTerm ClassificationSchemeLocation=. . . 

TermIdentifier="Place and Duration" /> 
<Relation source="Object" target="unknownl" type="ruleMorphism" /> 
<Relation source="LocationOf ' target="unknownr' 
type="mleMorphism'V> 

</Graph> 

[0046] An unknown as in the above example may be frequently used, so it can 
be built of smaller "words" and a classification scheme of unknowns may be stored for 
subsequent use. The unknown node is a "match objects, events, place and duration, with 
allowed edges between place and object, between duration and object, and likewise for 
events" node. It can be used for elements of an alphabet graph other than the unknown. 
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As interface or context, it provides restrictions on what may be attached to the unknown 
when the rule is applied. Multiple object and event nodes may be mapped to the unknown 
on a single operation (operations in parallel), but in this case, they need to be disjoint 
because there are no relations between the objects and events in the alphabet. All 
connections between them must go through the context. In an alternate embodiment, the 
adjacency matrix is also stored by assigning numerical values to all the edges and nodes in 
the alphabet and rule graphs in the order in which the graph is written. It will be 
appreciated that context need not be implicit as in the present example, but may be 
explicitly added to restrict what kind of relations or datatypes may occur in the rest of the 
graph. In one embodiment, since there is only one context in an alphabet graph its 
identifier and name is "context." 

[0047] While the above example uses object and event as abstractions and 
provides very general rules (in fact, giving rules for datatypes), the invention is not so 
limited. For example, the following alphabet and rule graphs allow any male and female 
to become married. 

<Graph id="interface" > 

<Node id="female'V> 

<Node id="male" /> 
</Graph> 

</Graph id="unknown" > 

<Node Event="ceremony" /> 

<Node Event="courtroomDance" /> 
</Graph> 

<Graph> 

<Node id="Bride'' /> 
<Nodeid="Groom"/> 
<Node Event=marriage /> 

<Relation source="Bride" target="marriage" type=participatesln /> 
<Relation source="Groom" target="marriage" type="participatesln" /> 
<:/Graph> 

<ControlledTerm schemeLocation="someURr' scheme=*'someScheme" 
term="someAlphabetGraph" /> 

<Graph> 
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<Relation source="bride" target="unknown" type='*ruleMorphism*' /> 
<Relation source="groom" target="unknown" type="ruleMorphism" /> 
<Relation source="marriage" target='*unknown" type="nileMorphism" /> 
<Relation source="participatesIn" target="unknown" 
type="ruleMorphism" /> 

</Graph> 

[0048] The various graphs used by the present invention are nested within a 
graphical classification scheme description scheme (GCS DS), which is defined as 
follows: 

<complexType name="GraphicalClassificationScheme" 
<complexContent> 
<ex tension base="mpeg7: ClassificationSchemeType"> 
<attribute name="strict" type="boolean" use="required" default="false" /> 
</extension> 
</complexContent> 
</complexType> 

This scheme allows the setting of the attribute "strict" to require that all correspondences 

to the alphabets in this scheme are to be "equal to," corresponding to an exact match. 

Other attributes may be specified to implement inexact matches. 

[0049] An optional "name" field is added to a graph type descriptor scheme 

entry in the GCS DS to specify the type of field which is represented by a particular node. 

Therefore a "name" field is added for this purpose. The name of this field is consistent 

with the field which gives the data type of the corresponding Relation. 

<complexType name="GraphType"> 
<complexContent> 

<extension base="mpeg7:DSType"> 

<choice minOccurs="0" maxOccurs="unbounded"> 
<element name="Node"> 
<complexType> 

<complexContent> 

<ex tension base="mpeg7 :ReferenceType"> 
<attribute name="id" type="ID" use="optional"/> 
<attribute name="name" type="string" use "optionar7> 
</extension> 
</complexContent> 
</complexType> 
</element> 
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<element name="Relation" type="mpeg7:RelationBaseType'7> 
</choice> 
</extension> 
</complexContent> 
</complexType> 

[0050] The template, alphabet, rule and morphism graphs are defined in a 
Graphical Term Definition descriptor that appears as an entry in the GCS DS. An 
"alphabetOraphRef ' element is used to specify an alphabet graph that is the target of a 
graph morphism. In addition, the different graph grammars for a particular rule are 
specified through an optional attribute "production." Valid values for the production 
attribute are PB (puUback), PO (pushout), DPB (double puUback), and DPO (double 
pushout). 

<complexType name="GraphicalTermDefinitionType"> 
<complexContent> 

<extension base="mpeg7:TermDefinitionBaseType"> 
<choice minOccurs="r' maxOccurs="unbounded"> 

<element name="templateGraph'' type="mpeg7:GraphType'V> 
<element name="alphabetGraph" type="mpeg7:GraphType"/> 
<element name="mleGraph" type="mpeg7:GraphType'7> 
<element name="alphabetGraphRef ' 

type="mpeg7 :TermUseType' V> 
<element name="morphismGraph" type="mpeg7:GraphType'7> 
</choice> 

<attribute name="termld" type="NMTOKEN'* use="required'7> 
<attribute name="production" type="string" use="optional'7> 
</extension> 
</complexContent> 
</complexType> 

[0051] The morphism graph is derived from a standard graph type and 
includes elements defining the source graph of the morphism ("SourceGraphRef '), a term 
reference to the source graph ("SourceGraphTermRef a reference to the target graph of 
the morphism ('TargetGraphRef and a term reference to the target graph 

CTargetGraphTermRef*). 

<complexType name="MorphismGraphType*'> 
<complexContent> 
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<extension base="mpeg7 :GraphType"> 
<choice minOccurs="r'> 

<element name="SourceGraphRef ' type="mpeg7:Reference"/> 
<element naine="SourceGraphTermRef ' 
type="nipeg7 :TermUseType'7> 

</choice> 

<choice minOccurs="l"> 

<element name="TargetGraphRef ' type="mpeg7:Reference"/> 
<element name='TargetGraphTerniRef ' 
type="mpeg7:TermUseType'7> 

</choice> 
</extension> 
</complexContent> 
</complexType> 

[0052] One embodiment of a computer system suitable for use as the graphical 
rewriting system 101 is illustrated in Figure 8. The computer system 40, includes a 
processor 50, memory 55 and input/output capability 60 coupled to a system bus 65. The 
memory 55 is configured to store instructions which, when executed by the processor 50, 
perform the methods described herein. The memory 55 may also store the multimedia 
content and descriptions. Input/output 60 provides for the delivery and display of 
multimedia content or portions or representations thereof. Input/output 60 also 
encompasses various types of computer-readable media, including any type of storage 
device that is accessible by the processor 50. One of skill in the art will immediately 
recognize that the term "computer-readable medium/media" further encompasses a carrier 
wave that encodes a data signal. It will also be appreciated that the computer system 40 is 
controlled by operating system software executing in memory 55. Input/output and 
related media 60 store the computer-executable instructions for the operating system and 
methods of the present invention as well as the multimedia content. It will be further 
appreciated that input/output 60 may include a network interface that connects the system 
40 to a conmiunications network, such as the Internet. 
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• 4 

[0053] The description of Figure 8 is intended to provide an overview of 
computer hardware and other operating components suitable for implementing the 
invention, but is not intended to limit the applicable environments. It will be appreciated 
that the computer system 40 is one example of many possible computer systems which 
have different architectures. A typical computer system will usually include at least a 
processor, memory, and a bus coupling the memory to the processor. One of skill in the 
art will inmiediately appreciate that the invention can be practiced with other computer 
system configurations, including multiprocessor systems, minicomputers, mainframe 
computers, and the like. The invention can also be practiced in distributed computing 
environments where tasks are performed by remote processing devices that are linked 
through a conmiunications network. 

[0054] The particular methods of the invention are described in terms of 
computer software with reference to a series of flow diagreuns in Figures 2 and 4. The 
methods constitute computer programs made up of computer-executable instructions 
illustrated as blocks (acts) 201 until 217 in Figure 2 and blocks 401 until 415 in Figure 4. 
Describing the methods by reference to a flow diagram enables one skilled in the art to 
develop such programs including such instructions to carry out the methods on suitably 
configured computers (the processor of the computer executing the instructions from 
computer-readable media, including memory). The computer-executable instructions may 
be written in a computer programming language or may be embodied in firmware logic. If 
written in a programming language conforming to a recognized standard, such 
instructions can be executed on a variety of hardware platforms and for interface to a 
variety of operating systems. In addition, the present invention is not described with 
reference to any particular programming language. It will be appreciated that a variety of 
progranmiing languages may be used to implement the teachings of the invention as 

080398.503 -25- 



described herein. Furthermore, it is common in the art to speak of software, in one form 
or another (e.g., program, procedure, process, application, module, logic...), as taking an 
action or causing a result. Such expressions are merely a shorthand way of saying that 
execution of the software by a computer causes the processor of the computer to perform 
an action or a produce a result. It will be appreciated that more or fewer processes may be 
incorporated into the methods illustrated in Figures 2 and 4 without departing from the 
scope of the invention and that no particular order is implied by the arrangement of blocks 
shown and described herein. 

[0055] Although specific embodiments of a graph rewriting system have been 
illustrated and described herein, it will be appreciated by those of ordinary skill in the art 
that any arrangement which is calculated to achieve the same purpose may be substituted 
for the specific embodiments shown. This application is intended to cover any 
adaptations or variations of the present invention. 
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